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Abstract. - A model for the evolution of a finite population in a rugged fitness landscape is 
introduced and solved. The population is trapped in an evolutionary loop, alternating periods 
of stasis to periods in which it performs adaptive walks. The dependence of the average rarity 
of the population (a quantity related to the fitness of the most adapted individual) and of the 
duration of stases on population size and mutation rate is calculated. 



The simplest conceivable evolutionary situation is a population of asexually reproducing indi- 
viduals set in a fixed environment. The reproductive power of an individual is measured by 
its fitness, i.e., by a quantity proportional to the expected number of its off'spring ^j. In 
model building, one often assumes that the fitness is determined by the genotype, and that 
the genotype itself is transmitted identically from parent to offspring, apart from mutations. 

The role of mutations is favorable if the overall fitness of the members of the population is 
low, because they allow the population to find genotypes with higher fitness, i.e., to adapt. 
This can be represented, following Sewall Wright by saying that the population approaches 
a fitness peak. On the other hand, mutations become pernicious when the population is located 
on such a peak, since they may let the population lose contact with it. In fact, the two effects 
have different relevance depending on population size: if the population size is large, and the 
fitness small, adaptation dominates; but if the population is small, and the fitness peak, no 
matter how lofty, is narrow, mutations have a negative effect. 

This situation has been described by two classes of models: 

1. In the quasispecies model [Q one takes the infinite population size limit from the outset, 
obtaining an equation (akin to a Master Equation) for the genotype distribution in the 
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population. It is interesting that, if the fitness peaks are narrow enough, this equation 
exhibits a transition (the error threshold) between an adaptive regime, and a regime in 
which adaptation is irrelevant. Nevertheless, the description of the non- adaptive regime 
is not satisfactory within this class of models. 

2. The adaptation process has been described as a special kind of random walk, the adaptive 
walk, by Kauffman and Levine Q and others. In this model, the fitness can only increase, 
and mutations only have positive effects. An "annealed" version of this model has been 
exactly solved by Flyvbjerg and Lautrup 

The stochastic escape of a finite population from a narrow fitness peak has been discussed by 
Higgs and Woodcock They find that, in the same limit in which the error threshold appears 
in its fullest glory in the quasispecies model, a finite population eventually loses contact with 
the adaptation peak. Building on this observation and on numerical simulations. Woodcock 
and Higgs |^ have been led to describe the behavior of a population evolving in a rugged fitness 
landscape (i.e., in a situation where even slight changes of the genotype lead to arbitrarily 
large changes in the fitness) as an evolutionary loop: 

1. If the fitness of the population is low, favorable mutations get fixed in the population, 
which thus performs an adaptive walk, reaching a local fitness peak. 

2. The population can be evicted from the adaptation peak by stochastic escape, and start 
a new adaptive walk from a random, usually low, fitness value. 

In this Letter, I introduce a solvable model that exhibits such a behavior. The model is a 
slight generalization of the Annealed Adaptive Walk Model introduced and solved by Flyvbjerg 
and Lautrup and allows for stochastic escape. It depends on only two parameters, namely 
population size and mutation rate. I argue that it should describe any mutation-selection 
model in the strong selection limit, i.e., when the fitness distribution is broad, provided that 
the correct variables are identified and the correct scaling of the parameters is performed. This 
conclusion is borne out by simulations of a slightly more realistic model which I report at the 
end of the Letter. 

I consider a population of M individuals (M is fixed) evolving in a rugged fitness landscape. 
At each generation, for each member a of the new population, its parent a' is chosen among 
the old population. The probability W-y that the individual 7 is chosen to reproduce is given 

by 

= (1) 

where Fj is the fitness of the individual 7. The genotype of a is taken to be equal to that of 
its parent a' apart from mutations. I denote by u the probability that the offspring undergoes 
at least one mutation. 

I make the following simplifications: 

Rugged fitness landscape: The fitnesses F of different genotypes are independent random 
variables, identically distributed according to some distribution p{F); 

Infinite genome Umit: On each mutation, a wholly new genotype appears in the population; 



Strong selection Umit: At each generation, only the fittest individuals are allowed to re- 
produce. 
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Given a fitness distribution p{F), one can define the rarity h of a fitness value F as the 
probability that, picking a new fitness at random with the distribution p{F) one obtains a 
higher fitness value 

h{F)= / dF'p{F'). (2) 

Note that higher fitness corresponds to smaller rarity. Within our hypotheses, it is easy to see 
that the genetic properties of the whole population are summarized by the rarity of its fittest 
individual, which we shall simply call the rarity of the population. Consider the population 
at generation t, and let h(t) be its rarity. For each individual of the new generation, decide 
(with probability u) if it mutates. If it does, assign to it a rarity ha with uniform probability 
between and 1. Otherwise, assign to it a rarity ha equal to h. The rarity of the population 
is then h{t + 1) = mhia ha- Therefore, the evolution of an ensemble of populations is described 
by the probability distribution function P{h,t) of their rarity. 

In the following I derive the evolution equation for P{h, t) and solve for its asymptotic form 
in the limit of large, but finite populations. 

Let us denote by vr^^ the probability that, at any one generation, exactly v individuals 
mutate: 



_Af I \ \M—ii 



u^{l~ut'-\ (3) 



and by (jiuih) the probability that, picking up v genotypes at random, one obtains rarities all 
larger than h: 

Mh) = {l-hr. (4) 
It is then easy to see that P{h, t) satisfies the following equation: 



P{h,t+1) = n^'P{h,t)-4^cl>'M{h) 

M-l 



(f)^{h)P{h,t) - (t>'^{h) dh'P{h',t) 



h 



(5) 



The first term represents the case in which there are no mutations, and the second one the 
case in which all individuals mutate, and the rarity of the new population is equal to h. The 
sum represents the case in which some individuals mutate, and some do not: the first term in 
square brackets describes the case where the rarity of all mutated individuals exceeds h, and 
the second one the case in which the rarity of at least one mutated individual is equal to h 
and smaller than the preceding rarity of the population. Introducing the notations 

<^{h,t) = f dh'P{h',t), (6) 



Jh 



g{h) = -nWMih)^u^'Mil~h)''-\ (8) 



eq. (|^) can be written 



$'(/i, t + 1) = f{h)^'{h, t) + /'(/i)$(/i, t) - g{h) = J- [/(/i)$(/i, t)] - g{h), (9) 
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where the primes denote derivatives with respect to h. The solution of this equation is 



Let us consider large values of M and set 

M 



In this limit, one has 



Jh 



f{h) 
dh'g{h') 



1 - m 



1 ^ 
M 



g-fe _ g-(fe+«'). 



■ / d/i'5(/i'). 

Jh 



-(fe+iu) 



We obtain therefore 



^{k,t)= lim $(fc/M,t) = e-'=*(l-e-^)**(A;,0) + 



l-e-''*(l-e-"')* 

Therefore, the f — > oo limit of '^{k, t), denoted by ^oo{k), is given by 

*oo(A:) ^ 



-(k+w) 



gi«(gfc _ 1) + l" 

One has 

*oo(0) = l; *oo(fc)~e-'= for A; ^ oo. 
It can be checked that the average value of k is given by 

{k} = -l^^dk'^Uk')k' = ^^, 



(10) 

(11) 

(12) 
(13) 

(14) 

(15) 
(16) 
(17) 



and varies from (fc) = 1 to (fc) = as w varies between and infinity. 

Let us define stasis the regime in which the rarity h of the population remains constant or 
decreases, and stochastic escape the event by which it increases, since a harmful mutation gets 
fixed in the population. We can then compute the distribution of stasis duration as follows. 
Denote by K{k,t) the conditional probability that, if the minimal rarity was equal to k/M at 
generation t = 0, it has never decreased up to generation t. This function obeys the following 
equation: 



K{k,t+1) = [l-7r^+7r^cl>M{k/M)]K{k,t) 



K{k,t), 



from which we deduce 

K{k,t) = h -e-('=+"') 
The average duration of a stasis is therefore 

{t) = 6^+"" - 1. 

If we take as a "typical" value of k that for which ^^{k) = i, we obtain {t) = . 



(18) 



(19) 



(20) 
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I now consider a population in which the probabihty Wa that an individual a reproduces is 
given by eq. (|l|). For the strong selection limit to hold, the probability that individuals with 
less than the maximum fitness reproduce should be negligible. This implies in turn that the 
fitness distribution function p{F) should be very broad. I have therefore assumed that the 
fitnesses are extracted from a Levy distribution p{F) = sF~*~^ for 1 < F < oo, which, for 
< s < 1, has an infinite second moment. The curves of (fc) vs. w should be independent of 
s. The results of the simulation this model are shown in Fig. ^ One notices that the points 
tend to lie above the theoretical curve for larger values of w. In fact, the deviations from the 
strong selection limit become significant in this regime. 




0.5 1 1.5 



Fig. 1. - Average scaled rarity (k) = M (h) over 250 000 generations as a function of the scaled 
mutation rate w = M(l - u). O: M = 32, s = 0.1; +: M = 32, s = 0.2; □: M = 128, s = 0.1; x: 
M = 512, s — 0.2; A: M — 512, s — 0.4. Root mean square deviations vary between 0.04 for w ~ 0.1 
to 0.006 for lu ~ 4. The dotted line corresponds to eq. (0). 



Indeed, the probability that the ratio of the largest to the second largest fitness value, 
extracted from a Levy distribution of parameter s, exceeds A, is given by A"'*, independently 
of the number of extracted values. The strong selection limit holds therefore only if among the 
individuals that reproduce without mutations (on average w — {I ~ u)M at each generation), 
the chance that the second fittest one reproduces is negligible. This happens if the probability 
that this ratio is smaller than w is itself small, say of order e. This implies in turn w~'^ > 1 — e, 
or, for small values of s, w < e'^^^. It may easily be checked that this is indeed the case. 

I am afraid that it would be difficult to observe stochastic escape in all but the smallest 
populations, although the analogous effect in smooth fitness landscapes (namely MuUer's 
ratchet |^) has been observed during repeated genetic bottleneck transfers of monoclonal 
antibody-resistant mutants of vesicular stomatitis virus, or of mutants of an RNA bacterio- 
phage ^ . I hope nevertheless that this approach provides at least a better grasp of the 
evolutionary behavior of a finite population in a rugged fitness landscape, and may be used as 
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a starting point for the study of more realistic situations. 
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